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ABSTRACT 

It  is  evident  that  military  applications  for  21st  century >  will  be  highly  complex,  multivariable  systems. 
Designing  optimal  controllers  for  such  applications  will  require  the  use  of  mathematical  models  which 
describe  the  complexity >  of  the  underlying  processes  as  accurately  as  possible.  For  robustness  it  is 
essential  that  these  models  include  the  uncertainties  in  the  estimated  process  dynamics  and  trajectories. 
Controllers  resulting  from  optimal  control  strategies  (like  LQG,  H2,  Hf  will  usually  be  of  very  high  order 
and  that  can  cause  implementation  and  computational  problems.  Some  of  these  problems  can  be  overcome 
by  using  controllers  that  are  of  lower  order  and  restricted  structure. 

The  subject  of  restricted  structure  controller  design  and  performance  assessment  is  relatively  new  and 
enables  the  expected  performance  to  be  assessed  against  a  much  more  realistic  criterion.  That  is,  the 
performance  figures  take  into  account  the  limitations  of  the  existing  control  system  structure,  and  hence 
provide  a  more  accurate  measure  of  the  possible  performance  improvement. 

However,  the  design  and/or  tuning  of  restricted-structure  controllers  in  order  to  provide  the  performance 
comparable  with  full-order  solutions  is  still  a  veiy  contentious  issue.  Added  to  this  is  the  need  to  achieve 
robust  properties  and  performance  specifications  required  by  military  applications.  Very  few  methods  for- 
designing  restricted-structure  controllers  exist  that  allow  the  performance  and  robustness  objectives  to  be 
combined  into  one  relatively  simple  optimisation  problem.  This  lecture  presents  an  LQG/H2-based 
method  that  tackles  the  above  mentioned  issues. 

This  work  also  provides  the  exciting  possibility  of  enabling  the  multivariable  structures  of  systems  to  be 
assessed.  Thus  for  example,  it  is  possible  to  check  whether  a  diagonal  multivariable  controller,  upper 
triangular,  lower  triangular  or  sparser  structures  might  be  almost  as  good  as  a  full  multivariable  control 
law.  This  technique  provides  advantages  over  the  commonly  used  technique  of  so  called  relative  gain 
array  forjudging  the  best  structure  for  a  multivariable  system. 

All  these  techniques  are  applicable  to  continuous  or  discrete-time  linear  systems,  although  the  nonlinear 
behaviour  can  readily  be  accommodated  in  the  design  through  the  use  of  multiple-model  strategy.  This 
lecture,  however,  will  also  introduce  a  control  law  that  is  derived  for  the  control  of  nonlinear,  possibly 
time-varying  systems.  The  solution  for  this  Nonlinear  Generalized  Minimum  Variance  control  law  is 
original  and  was  obtained  using  a  simple  operator  representation  of  the  process.  The  quadratic  cost 
index  involves  both  error  and  control  signal  costing  terms.  The  controller  obtained  is  simple  to  implement 
and  includes  an  internal  model  of  the  process.  In  one  form  it  might  be  considered  a  nonlinear  version  of 
the  Smith  Predictor. 


1.0  INTRODUCTION 

The  design  and  performance  assessment  of  optimal  controllers  (Kwakernaak  and  Sivan,  1972  [1], 
Desborough  and  Harris  1993  [2],  Uduehi  and  Grimble  2001  [3])  is  considered  as  well  as  the  ways  in 
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which  robustness  properties  can  be  modified  and  assessed.  The  minimization  of  quadratic  cost  functions 
for  stochastic  linear  systems  in  such  a  way  that  the  robustness  margins  are  improved  is  of  interest.  There 
are  of  course  well  known  guaranteed  robustness  properties  for  systems  with  state  feedback  (Anderson  and 
Moore  1971  [4],  Safonov  and  Athans  1986  [5],  Doyle  1978  [6])  and  in  some  problems  these  properties 
can  be  recovered  using  a  loop  transfer  recovery  approach  ([7], [8], [9], [10], [1 1], [12]).  However,  more 
direct  methods  of  tuning  robustness  properties  would  be  valuable  (Doyle  and  Stein,  1981  [13]).  This  is 
explored  for  both  full-order  optimal  output  feedback  controllers  and  for  low  order  (restricted  structure) 
optimal  solutions. 

Two  aspects  of  the  design  of  full-order  and  restricted  structure  ( RS)  optimal  controllers  are  considered  in 
the  following.  The  first  involves  robustness  improvement  and  the  second  is  concerned  with  the  noise 
rejection  properties.  An  LQG  criterion  is  to  be  optimized  but  the  usual  robustness  and  noise  rejection 
properties  will  not  hold  if  the  controller  structure  is  limited  to  say  a  PID  or  a  low  order  lead-lag  form. 
Grimble  (1999  [14],  2000  [15])  introduced  a  polynomial  systems  approach  to  restricted  structure  optimal 
control  design  and  this  is  the  philosophy  followed  here.  However,  the  design  of  such  controllers 
previously  focussed  on  performance  issues  and  the  robustness/noise-rejection  aspects  were  not  considered 
in  any  detail.  The  strategy  for  robustness  improvement  is  to  add  a  fictitious  signal  and  a  sensitivity  costing 
term  in  the  criterion.  This  enables  the  penalty  on  sensitivity  to  be  directed  at  modifying  the  robustness 
properties.  The  normal  LQG  cost-index  does  of  course  include  error  and  control  signals  that  depend  upon 
the  sensitivity  functions.  However,  this  does  not  enable  these  sensitivity  terms  to  be  costed  in  a  particular 
way.  The  proposed  robustness  weighting  term  gives  free  choice  of  the  weighting  function  and  enables  the 
H2  norm  of  a  weighted  sensitivity  function  to  be  minimized.  This  relates  to  the  definition  of  a  so-called 

Dual  Criterion  (Grimble,  1986  [16]),  but  the  results  here  are  focussed  on  the  design  issues  and  they  use 
what  might  be  termed  a  Kucera  polynomial  systems  approach,  (1980  [17]).  The  frequency-domain 
polynomial  systems  approach  is  particularly  helpful  when  determining  the  frequency  response  behaviour 
to  noise  and  disturbance  signals  (Grimble,  1994  [18]). 

The  above  control  algorithms  and  methodology  are  based  on  the  linearized  description  of  the  process  and 
are  valid  in  a  particular  operating  point.  The  aim  in  the  following  is  to  introduce  a  controller  for  nonlinear 
multivariable,  possibly  time-varying,  processes.  This  Nonlinear  GMV  (NGMV)  control  law  is  related  to 
the  family  of  LQG  designs  but  is  based  more  on  the  rich  heritage  of  minimum  variance  controls.  Astrom 
introduced  the  Minimum  Variance  (MV)  controller  assuming  the  linear  plant  was  minimum  phase  and 
later  derived  the  MV  controller  for  processes  that  could  be  non-minimum  phase  (Astrom  1979  [19]).  The 
latter  was  guaranteed  to  be  stable  on  non-minimum  phase  processes,  whereas  the  former  was  unstable. 
Hastings-James  (1970  [20])  and  later  Clarke  and  Hastings-James  (1971,  [21]),  modified  the  first  of  these 
control  laws  by  adding  a  control  costing  term.  This  was  termed  a  Generalized  Minimum  Variance  (GMV) 
control  law  and  enabled  non-minimum  phase  processes  to  be  stabilized,  although  when  the  control 
weighting  tended  to  zero  the  control  law  reverted  to  the  initial  algorithm  of  Astrom,  which  was  unstable. 
However,  the  control  law  had  similar  characteristics  to  LQG  design  in  some  cases  and  was  much  simpler 
to  implement.  This  simplicity  was  exploited  very  successfully  in  the  so-called  generalized  MV  self-tuning 
controller  introduced  by  Clarke  and  Gawthrop  (1975,  [22]).  The  control  of  nonlinear  non-minimum  phase 
linear  systems  using  a  GMV  type  algorithm  was  considered  by  Grimble  (1981  [23]).  The  use  of  GMV 
control  laws  for  linear  systems  designs  was  reviewed  in  Grimble  (1988  [24]).  The  use  of  dynamic  cost 
weightings  in  the  GMV  cost  index  (1994  [18])  provided  additional  flexibility  and  the  dynamic  costing 
solution  was  exploited  to  obtain  a  Generalized  Hx  controller  (Grimble  1993  [25]).  All  of  these  results 
were  applicable  to  linear  discrete-time  stochastic  processes. 

The  NGMV  control  law  uses  a  similar  stochastic  framework  but  generalizes  and  extends  these  results  to 
account  for  nonlinear  system  behaviour.  The  structure  of  the  system  was  defined  so  that  a  simple 
controller  structure  and  solution  are  obtained.  When  the  system  is  linear  the  results  revert  to  those  for  the 
GMV  controller  referred  to  above  (Grimble  2001  [26]).  There  is  some  loss  of  generality  in  assuming  the 
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reference  and  disturbance  models  are  represented  by  linear  subsystems.  However,  the  plant  model  can  be 
in  a  very  general  nonlinear  operator  form,  which  might  involve  state-space,  transfer  operators  or  even 
nonlinear  function  look  up  tables.  That  is,  the  input  sub-system  to  the  plant  might  include  valves  or  a 
servo-system  that  has  no  traditional  equation  based  model.  The  input  nonlinear  subsystem  can  be  a  black 
box.  No  state  space  or  operator  structure  is  needed.  The  optimal  solution  reveals  all  that  is  needed  is  a 
method  of  computing  the  output  from  such  subsystem  given  a  control  input.  If  on  the  other  hand  an 
equation  based  model  is  available  it  may  be  used  directly.  For  this  reason  the  nonlinear  part  of  the  plant  is 
represented  in  operator  (unstructured  form)  for  most  of  the  analysis.  The  nonlinear  dynamic  terms,  in  the 
plant,  only  need  to  be  open  loop  stable  and  can  be  very  complicated.  The  ability  to  introduce  very  general 
plant  structures,  without  formal  models,  is  a  major  advantage  of  the  method. 

For  linear  systems  stability  is  ensured  when  the  combination  of  a  control  weighting  function  and  an  error 
weighted  plant  model  is  strictly  minimum  phase.  For  nonlinear  systems  a  related  operator  equation  must 
have  a  stable  inverse.  It  is  shown  that  if  there  exists  say  a  PID  controller  that  will  stabilize  the  nonlinear 
system,  without  transport  delay  elements,  then  a  set  of  cost  weightings  can  easily  be  defined  to  guarantee 
the  existence  of  this  inverse  and  thereby  ensure  the  stability  of  the  closed  loop.  If  the  plant  is  open-loop 
stable  the  solution  can  be  realized  in  a  particularly  simple  form  which  relates  to  the  well  known  Smith 
Predictor  for  systems  with  significant  transport  delays.  This  has  the  advantage  of  providing  some 
confidence  in  the  practical  utility  of  the  solution  and  also  introduces  what  might  be  termed  an  extension  of 
these  Smith  controllers  for  nonlinear  plants.  A  so-called  Nonlinear  Smith  Predictor  will  therefore  be 
introduced.  The  main  advantage  over  other  nonlinear  control  design  methods  (Isidori  1995  [27])  is  the 
simplicity  of  the  solution. 


2.0  SYSTEM  MODEL 

The  system  shown  in  Fig.  1  is  for  the  time  being  assumed  to  be  linear,  continuous-time  and  single-input, 
single-output.  The  external  white  noise  sources  drive  colouring  filters  which  represent  the  reference 
Wr(s),  measurement  noise  Wn(s),  robustness  modification  Wp(s)  and  disturbance  Wd(s)  subsystems.  The 
robustness  model  does  not  exist  physically  but  is  introduced  for  design  modification.  The  system 


equations  become: 

Input  disturbance:  d(s)-Wd(s )t( s)  (2.1) 

Robustness  signal:  p(s)  =  Wp(s)rj(s)  (2.2) 

Output:  y(s)  =  d(s)  +  p(s)  +  W(s)u(s)  (2.3) 

Reference:  r(s  )-Wr(s  )CJ  s  )  (2.4) 

Tracking  error:  e(s)  =  r(s)  -  y(s)  (2.5) 

Observations:  z(s)  =  y(.v)  +  n(s)  (2.6) 

Measurement  noise:  n(s)-Wn{s)co{s )  (2.7) 

Control  signal:  u(s)  -  C0(s)(r(s)  -  z(s))  (2.8) 


The  system  transfer  functions  are  all  assumed  to  be  functions  of  the  Laplace  transform  complex  number  in 
the  complex  frequency  domain.  For  notational  simplicity  the  arguments  in  W(s)  and  the  other  models  are 
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omitted. 


Figure  1:  Single  Degree  of  Freedom  Unity  Feedback  Control  System  with 
Measurement  Noise,  Reference  and  Disturbance  Models  Assumptions 

Assumptions: 

•  The  white  noise  sources,  $  co,  //  and  C  are  zero-mean  and  mutually  statistically  independent.  The 
intensities  of  these  signals  are  without  loss  of  generality  taken  to  be  of  value  unity. 

•  The  system  W  is  assumed  free  of  unstable  hidden  modes  and  the  reference  Wr ,  noise  Wn  ,  robustness 
Wp  and  disturbance  Wd  subsystems  are  asymptotically  stable. 

The  following  expressions  may  easily  be  derived  for  the  output,  error,  observations,  controller  input, 


control  and  sensitivity  costing  signals: 

Output :  y(s)  =  WC0(  1  +  WCoy\r(s )  -  n(s))  +  (1  +  WC0)~\d(s )  +  p(s))  (2.9) 

Error  :  e(s)  =  r(s)-y(s)  =  (1  +  WC0)~\r(s )  -  d(s)  -  p(s))  +  WC0(  1  +  WC^n^s)  (2.10) 

Observations:  z(s)  =  WC0(l  +  WC0)~lr(s)  +  (1  +  WC0)~l  (d(s)  +  n(s)  +  p(s))  (2.1 1) 

Controller  input :  ^o(5)  =  r(s)  ~z(s)  =  (1  +  WC0)~1(r(s)-d(s)-n(s)-  p(s))  (2.12) 

Control  signal:  u(s)  -  (1  +  WC0  )-1  C0  (r(s)  -  d(s)  -  n(s)  -  /»(«))  (2. 1 3) 

Robustness  signal:  h(s)  =  (l  +  WC0)~l p(s)  (2.14) 

These  equations  include  the  following  sensitivity  operators: 
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Sensitivity: 

s=(i+wc0r1 

(2.15) 

Complementary  sensitivity  : 

T  =  l-S  =  WC0S 

(2.16) 

Control  sensitivity: 

m  =  c0s  =  c0  (i  +  wc0  y1 

(2.17) 

The  system  shown  in  Fig.  1  may  be  represented  in  polynomial  form  ( Kailath  1980  [28]),  where  the  system 
transfer  functions  are  written  as: 

System  : 

w=  a~!b 

(2.18) 

Reference  generator: 

"->1 

1 

II 

(2.19) 

Input  disturbance: 

0s 

1 

ii 

£ 

(2.20) 

Measurement  noise: 

K=A~1cn 

(2.21) 

Robustness  signal  model: 

Wp  =  A~XC p 

(2.22) 

There  is  no  loss  of  generality  in  assuming  these  models  have  a  common  denominator  A  polynomial.  The 
various  polynomials  are  not  necessarily  coprime  but  the  system  transfer  function  is  assumed  to  be  free  of 
unstable  hidden  modes.  The  coprime  representation  of  the  system  is  denoted  by  A()  1  B0 ,  where  B=BoUo 
and  A=A  nil 0. 

The  spectrum  of  the  signal  r(s)-d(s)-n(s)-p(s)  in  equations  (2.12)  and  (2.13)  is  denoted  by  fj(s)  and  a 
generalised  spectral-factor  IT  may  be  defined  from  this  spectrum,  using: 

YfYf  =  +  ®pp  (2.23) 

In  polynomial  form  Yj  =  A  1  Df  .  The  disturbance  model  is  assumed  to  be  such  that  ZL  is  strictly  Hurwitz 
and  satisfies: 


Dj  Df  -  EE  +  CdCd  +  CnCn  +  CpCp  (2.24) 

The  role  of  the  robustness  model  Wp  may  now  be  explained  since  it  is  one  of  the  components  in  the 
combined  signal  spectrum  ®ff  defined  in  (2.23).  Clearly,  if  ®pp  is  dominant  then  the  spectrum 

®ff  — >  ®pp .  This  implies  that  ®e„e0  ~ 5 >  S® ppS*  and  this  spectrum  will  provide  the  robustness  term 
(suitably  weighted)  introduced  in  the  cost  function,  in  the  next  section. 

3.0  LQG  CRITERION  AND  RESTRICTED  STRUCTURE  CONTROL 

The  LQG  cost-function  to  be  minimized  ( Youla  et  al  1976  [29])  is  defined  as: 
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J  = 


{QM^eeis)  +  Rc(s)®uu(s)  +  Pc(S)®e0e0  (*)}* 


(3-D 


where  Qc,  Rc,  Pc  represent  dynamic  weighting  elements,  acting  on  the  spectra  of  the  error  eft),  feedback 
control  u(t)  and  controller  input  e0(t)  signals.  The  Rc  weighting  term  is  assumed  to  be  positive  definite 
and  Qc  ,  Pc  are  assumed  to  be  positive-semidefmite  on  the  D  contour  of  the  s-plane.  The  robustness 
weighting  term  can  be  motivated  as  in  the  dual  criterion  results  of  Grimble  (1986  [30]).  The  error,  control 
and  robustness  weightings  can  be  written  in  polynomial  form  as: 


BqBq 

A*qAq 


R.  =- 


R„ 


B.B,. 


A,.  A,.  A,.  A,. 


and 


P  =  - 


AqAq 


BpBp 

A*qAq 


(3.2) 


where  Aq  is  a  Hurwitz  polynomial  and  A r  is  a  strictly  Hurwitz  polynomial  ( Grimble  and  Johnson  1988 
[31]).  The  problem  will  be  to  minimize  the  above  criterion  with  the  controller  chosen  to  have  a  specified 
structure  [15].  The  Pc  weighting  term  represents  a  robustness  weighting  to  be  explained  later.  For  later  use 
let  the  weightings  be  rewritten  using  the  common  denominator,  so  that: 


Qc=Qc+Pc  =  Qcn+F: '?n  =%or  Qc=  Qcn  !{AqA*qAX)  =  Qcn  /( 4 X) 


AqAq  AqAq 


where  Qcn=QcnArA,.  and  Aw=AAr  .  Similarly  for  the  control  weighting: 


Rc=-^X  =  Ron  KaXaX  )  =  Rcn  haX) 


ArAr 


where  Rcn  =  RcnAqAq 


Theorem  3.1:  Restricted  Structure  Single  Degree  of  Freedom  LQG  Control  Problem 

Consider  the  LQG  error  and  control  weighted  criterion  defined  in  (3.1),  and  the  system  introduced  in  §2. 
The  conditions  that  determine  the  LQG  controller  of  restricted  structure  are  derived  below.  The  derivation 
includes  the  robustness/sensitivity  costing  and  coloured  measurement  noise  model.  These  terms  were  not 
considered  in  the  previous  polynomial  approaches  to  the  restricted  structure  control  design  problem.  The 
cost-function  to  be  minimised  was  defined  in  as, 

J  =  {Qc®ee  +  Rc®uu  +  e0e0 

2.TC ]  D 

Noting  the  independence  of  the  noise  sources  and  recalling  (2.10)  to  (2.13)  and  the  definitions  for 
sensitivity  in  equations  (2.15)  to  (2.17)  obtain  by  substituting  in  (3.1)  : 

1  \Qc(\-WM)<b  ff(\-W*M*)  +  Pc0--WM)<b 

J  = - \ds 

2xjD[+RcMQ>ffM*-Qc(\-WM)<S>nn -$„„(! -MV*)&  +QcOm\ 
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y  =  QcW  +  RcW®ffM  -M  W 

2xj  b  \-(QX  ff  -  Qc®nn)WM  +  QcQfr-  Qc<$>nn 


(3.3) 


where  Qc  -  Qc  +  Pc  and  this  may  be  written  in  the  alternative  forms  Q,  -  ^c'\  -  @cn  +  or  using  a 


AqAq  AqAq 


common  denominator  for  the  spectral  factor: 


Qc  =  Qcn  !(AAAX)  =  Qcn  JA’X) 


where  Qcn  =  Q  ArAr .  Then  define, 


®jp=W  (Qc®ff-Qc®nn) 


■c*  ff  nn> 


(3.4) 

(3.5) 


and  ®  =  ®  +0.^+0  +0  . 

jj  rr  da  nn  pp 


The  generalised  spectral  factors  Yc  and  Yf  due  to  Shaked  (1976  [32])  may  now  be  defined,  using: 


YCYC  =W  QCW  +  Rc 


YfYf  =®  =  ®  +® 

j  j  JJ  rr  dd  nn  pp 


(3.6) 

(3.7) 


Completing  the  squares  in  equation  (3.3)  obtain: 


J  =  ^~A  {(YcMYf--^)(YcMYf--^)*  +  %}ds 
2xjD  YcYf  YcYf 


(3.8) 


where 


®0  -  Qc®  ff  Qc® nn 


®  r  ®  , 

fp  fp 

s|«  + 

YJJfYf 


(3.9) 


Substituting  in  the  spectral-factor  expressions  (3.6)  and  (3.7),  using  the  polynomial  system  models  in 
equations  (2.18)  to  (2.22),  obtain 


YfY}  =  (EE*  +  CdC*  +  C„C*  +  CpC*p)/(AA*) 


p  pJ 

YX  =  (BfQcnB  +  A"  RcnA)  /( A*  A*WAWA) 

Qcn=(Qcn  +  Pcn)Ar4  and  Rcn  =  Rc„AqA* 


where 


(3.10) 

(3.11) 

(3.12) 


To  obtain  the  polynomial  spectral-factors  define  the  filter  Dj  and  control  Dc  spectral  factors  from  (3.6) 
and  (3.7)  respectively,  as: 


DfD*f  =  EE*  +  CdC*d  +  CnC*n  +  C  pCp 
D>c=B*QcnB  +  A*RcnA 


(3.13) 

(3.14) 


Recalling  that  A  and  Ar  are  normally  chosen  to  be  coprime,  the  generalized  spectral  factors  may  be 
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written  in  the  form: 

Yj-=A~iDj-  and  YC  =  A~IDC  where  AC=AAW  and  Aw=AqAr 

The  various  terms  in  the  criterion  (3.1)  may  now  be  simplified  by  substituting  from  the  polynomial  system 
models  in  and  the  spectral  factor  results  given  above. 


® jp=W  (Qc® ff  - Qc® 


B  .QcnDfDf  QcnCnCn 


ff 


A 


*  * 
AqAqAA 


) 


O f>/(YcYf)  =  B  ( 


QcnDfDf 

AA„ 


Qcn^n^ny  A- 


AA„ 


D„D, 


(3.15) 


The  measurement  noise  subsystem  must  be  asymptotically  stable  and  after  cancellation  of  common  terms 
may  be  written  as:  A~xCn  =  A~qCii0  ,  where  A  =  An0  A() .  The  following  diophantine  equations  must  be 
introduced: 

Feedback  Diophantine  equations:  Calculate  ( G0 ,  H0,  F0 )  with  F0  of  minimum  degree: 

DlG0+F0AAq=B*Qcn4Df  (3.16) 

D*H0  - F0BAr  =  A*RcnA*Df  (3.17) 

Implied  equation:  Multiplying  (3.16)  by  BAr  and  (3.17)  by  AAq  and  adding  the  equations  obtain: 

D*c(G0BAr  +  H{)AAq)  =  (BQcnB  +  A*  Rcn  A)  Df 


•  •  •  ^  . 
and  after  division  by  Dc  obtain: 


G0BAr  +  H0AAq  -  DcDj 


Measurement  noise  equation: 

o  +  Y0An0Aq  =  B  QcnCn0CnAr 


(3.18) 


(3.19) 


The  terms  in  the  cost-optimization  problem  may  now  be  considered,  substituting  from  the  above 
polynomial  equations.  Substituting  from  the  Diophantine  equations  (3.16)  and  (3.17): 


0 


fp 


0 


Y„  Y  f 


+  4)-(^- 

H  Dc  yAn0Aq 


D„Df 


Considering  now  the  term  YcMYj- ,  writing  C0  -  C01dC0n ,  obtain  : 


(3.20) 
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YcMYf  = 


DcDfC0n 


AAw(AC0d  +  BC0n) 


(3.21) 


The  first  squared  term  in  (3.8),  using  (3.21)  now  becomes: 


®jp  _  [DcDfCon  ~(Gq  ~ X0A0)Ar(AC0d  +BC0/I)]  ( Y0-F0Df ) 

c  f  r(y;  AAw(Ac0d + bc0ii)  d:d : 


Substituting  from  the  implied  Diophantine  equation: 


v  ®fP  [W0An0Aq  +X0ArB)C0n  ~(G0An0  -  X0A)A,.C0d]  (Y0  -F0Df) 

YcMYf~^Sr  = - - - +  ■ 


YcYf 


^noAiX^Cod  +  BC()n) 

This  cost  term  expression  may  be  written  in  the  form: 


DcDf 


YcMYf-^&  =  Ti+Ti 

YCYf 


(3.22) 


(3.23) 


(3.24) 


where  the  term  within  the  square  brackets  in  (3.23)  denoted  by  7}+ .  This  term  is  stable,  since  Aw  is 
Hurwitz  and  the  closed-loop  characteristic  polynomial  pc  =(AC0d  +  BC0n)  is  required  to  be  strictly 

#stc.. 

Hurwitz  for  Jmin  <  co  .  The  final  term  in  (3.23)  is  strictly  unstable  since  Dc  is  strictly  non-Hurwitz. 

4.0  COST  FUNCTION  MINIMIZATION  AND  PARAMETRIC  OPTIMIZATION 

Given  the  simplification  of  terms  in  the  cost-function  presented  above  the  cost  minimization  procedure 
may  be  followed  ( Grimble  and  Johnson,  1988  [31]).  Note  that  the  cost-  function  (3.8)  may  be  written, 
using  (3.24)  as: 


J  =  ^-A  {(T{  +  TfXT;  +  7}-/  +  O 0}ds  (4.1) 

2nj  D 

From  the  Residue  theorem  the  integrals  of  the  cross-terms  7’/  Tf  ,  7}~7}+  can  be  shown  to  be  zero.  This 
result  follows  because  <j)  7[ !  7]  ds  =  —  (|;  T]  1  ds  but  the  term  T~T**  is  analytic  for  all  s  in  the  left  half 

plane  so  that  the  sum  of  the  residues  obtained  in  calculating  (j)  7]~7]+  ds  is  zero. 

Note  that  this  result  still  applies  if  the  function  7]-7]+*  contains  poles  on  the  ja>  axis,  since  they  can  be 
avoided  by  the  D  contour,  using  small  semi-circular  detours  in  the  left-half  plane.  These  semi-circles  are 
centred  on  these  poles  and  do  not  contribute  in  the  limiting  case  as  the  radius  tends  to  zero.  Also  observe 
that  the  term  containing  7,1~7]+  could  lead  to  an  infinite  cost  should  such  terms  be  present.  However, 
these  may  not  be  present,  since  the  optimal  control  may  be  chosen  so  that  they  cancel.  The  practical  case 
when  this  arises  is  when  the  error  weighting  includes  an  integrator  Aq(s)  =  s.  When  the  controller 
denominator  C0d(s)  includes  integral  action  the  A  polynomial  cancels  throughout  the  term.  The 
consequence  is  that  the  criterion  can  have  a  finite  minimum,  even  though  certain  cost  function  terms 
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include  j  axis  poles.  The  cost-function  therefore  simplifies  as: 


J  =  ^~$  {(T+Tf*  +  TfTf*)  +  (t>0}ds 
2nj  D 


(4.2) 


Since  the  terms  77  and  ®0  are  independent  of  the  controller,  the  criterion  J  is  minimised  when  the  first 
term  involving  7)'  is  minimized.  However,  if  the  feedback  controller  C0  has  a  restricted  structure  then  it 
is  unlikely  that  7}'  can  be  set  to  zero.  It  follows  that  to  minimize  the  cost-function  the  first  term  in  (4.2) 

should  be  minimised,  through  the  choice  of  C0,  namely:  J0  =  — - — (£  {T1+Tl+*}ds  (4.3) 

2  nj  l 


For  a  finite  solution  to  this  cost  minimization  problem  to  exist  the  7}+  term  must  be  asymptotically  stable. 

[(H0An0Aq+X0ArB)C0n  ~(G0An0-X0A)A,.C0d] 


Inspection  of  this  term: 


r=- 


Aio  AA-ACod  +bc0ii ) 


(4.4) 


reveals  that  all  terms  are  asymptotically  stable  but  the  weighting  A  could  include  a  j  axis  zero  (A  is  only 

assumed  to  be  Hurwitz).  However,  it  is  assumed  that  although  the  structure  of  the  controller  C0  =  C0nC0d 
is  limited,  Cnd  will  have  zeros  at  the  j  axis  zeros  of  the  chosen  weighting  Aq.  Thus,  such  a  zero  will  cancel 

and  under  the  given  assumptions  7}+  is  asymptotically  stable. 

Then  the  LQG  controller  of  restricted  structure  may  be  calculated  from  a  simple  direct  optimization 
problem.  First  compute  the  filtering  and  control  spectral  factors  and  Dc  (strictly  Hurwitz  due  to  the 
system  description)  using: 


DfD}  =  EE*  +  CdC*d  +  C„C,;  +  CpC*p  (4.5) 

D*DC  =  B*QcnB  +  A*RcnA  (4.6) 

The  following  regulating  Diophantine  equations  must  then  be  solved  for  (G0,  H0,  F0),  with  F0  of 
minimum  degree: 


D*G0  +  F()AAq  =  B*QcnA*rDf  (4.7) 

D*H0  -F0BAr  =  A*RcnA*Df  (4.8) 

and  the  following  measurement  noise  Diophantine  equation  must  be  solved  for  (Xfj,  Y0),  with  Y0  of 
smallest  degree:  D*D}xo  +  Y0An0Aq  =  B*QcnCn0C*nA*  (4.9) 

The  optimal  controller  C0  =  C()nCdd  must  then  be  found  to  minimize  the  following  component  in  the  cost- 

1  r“ 

function  term:  J0  = —  {T^ (jco)^ (—jco)}dco  (4.10) 

2n 
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,  T+  [(# oA oAq  +  X0ArB)C0n  -  (G0An0  -  X0A)ArC0d  ] 

where  T,  - - - - 

AiO -AqA-i-ACod  +^Q„) 

If  the  controller  has  a  specified  limited  structure  the  minimum  of  the  cost  term  J0min  will  be  non-zero.  For 
an  unconstrained  solution  the  minimum  is  achieved  when  7/  =  0  and  the  minimum  of  J0  (denoted  J0min) 
is  zero.  It  has  been  explained  in  Theorem  3.1,  that  the  computation  of  the  optimal  feedback  controller  Cg 
reduces  to  minimization  of  the  term  J0. 

It  is  clear  from  (4.4)  that  7}+  can  be  written  in  the  form:  7}+  =  (C0nL,  -  C0dL2)/  ( C0nL3  +  C0dL4 )  (4.1 1) 
where  C0  -  C0n  /  C0d  has  a  specified  structure  which  can  be  as  expressed  below; 


Reduced  order: 


C0(s)  = 


cn0  +CnlS+...+CnpSP 
Cd0  +Cd]S+...+CdvSV 


where  v  >  p  is  less  than  the  order  of  the  system  (plus  weightings) 


Lead  lag:  C0(s)=  (c"0  +  cms)(cn2  +  c„3°) 

(cd0  +cdis)(cd2+cd3s) 

PID:  C0(s)-k0+kj  /  s  +  k2s 

The  assumption  must  be  made  that  a  stabilising  control  law  exists  for  the  assumed  controller  structure. 
Note  that  the  controller  structure  should  be  consistent  with  the  choice  of  error  weighting,  in  the  sense  that, 
if  Aq  includes  a  j  axis  zero,  then  the  controller  denominator  Cggfs)  should  also  include  such  a  zero.  The 
solution  of  this  optimisation  problem  may  be  obtained  using  the  following  results.  Assume,  for  example, 
that  Cq  has  a  modified  PID  structure  of  the  form: 


C0  -  k0  +(kj  /  s)  +  (k2s  /  (1  +  sx)) 

(4.12) 

so  that  the  numerator  : 

C0n  -  k0(l  +  sx)s  +  k1(l  +  sx)  +  k2s2 

(4-13) 

and  the  denominator  : 

Cod=s(l  +  sx) 

(4-14) 

Let  the  superscripts  r  and  i  denote  the  real  and  imaginary  parts  of  a  complex  function,  so  that 
C0n  =  C\)n  +  jC‘0n  and  C0d  =  C'0d  +  jC‘0d  .  The  controller  numerator  term  may  be  split  into  frequency 
dependent  components,  through  comparison  with  (4.13): 

C0n( j(o )  =  -k0(o  x  +  kd  -  k2(o  +j(k0(o  +  k^x)  (4.15) 

and  Cqu( jm)  =  -k0oAx  +  kd  -  k2m2  and  C,0n(ja))-k0a)  +  kIG)x  (4.16) 

Similarly,  for  the  denominator  term:  C0d  ( jm)  =  -co  x  +  jm  (4. 1 7) 
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and  hence 


C'od  (j&)  =  -ro2?  and  Cl0d  (jco)  =  ro 


(4.18) 


If  the  solution  of  the  optimization  problem  is  to  be  found  by  iteration,  the  denominator  term  in  7}+  can  be 
assumed  to  be  known  and  the  minimisation  can  then  be  performed  on  the  numerator  (linear  terms).  Thus, 
to  set  up  this  problem  let,  7}+  =  C0nLnl  -  C0dLn2 


where 


kni  -  Li  /  ( C0t 


L3  +  Ci 


l4) 


0d^4 


and 


kn2  ~  L2  /  ( C0nL3  +  C, 


l4) 


0d^4 


(4.19) 


Substituting  from  (4.1 1)  and  (4.19). 


t;  =  c:  u  c:,n , + cLii ,  +  uc*  n ,  +  c:  u ,  -  c:,f ,  -  c' zr , ) 

1  On  n  1  0«  n\  0 a  n2  0 a  n2  J  \  On  nl  On  n  1  0 d  n 2  0 a  n2s 


and  after  substitution  from  (4.16)  and  (4.18)  obtain 

K  +  j(pLrm  -  m-tLJ)  +  kx  {Lrnl  -  cQTlinX  +  j{corEnX  +  L‘j) 


T  - 
1\  ~ 


+k2  (~o)2nnl  -  jorknl )  +  orzLn2  +  coLn2  +  j(orzLn2  -  coLrn2 ) 


The  real  and  imaginary  part  of  7}+  may  therefore  be  written  as:  7)'  =  T/r  +  jTj'1  and  it  follows  that, 

2  /  ,  \2  /  ,.\2 


ll 


=  (T'/  r  ]j  +  (t)1'  j  Write  a  vector  form  of  the  above  equations  as: 


=  F 


ki 

k2 


-L-Fx-L  . 


where 


F(  m)  = 


-co  ("rot  Lrnl  +  L‘nl) 

(£>( Lrnl  ~(£>xFnl) 


(L'nl-03xL‘nl) 
(coxL'nl  +  L‘nl) 


-C02Fni 

ji 

ro  Lnj 


and  L((x>)  = 


—(£>(  Fn  2  +  rot L’n2) 

r  2  i 

(»Ln2  ~  w~  xLn2 


The  cost- function  can  be  optimised  directly  but  a  simple  iterative  solution  can  be  obtained  if  the  integral  is 
approximated  ( Yukitomo  et  al  1998  [33])  by  a  summation  with  a  sufficient  number  of  frequency  points 
{(0j,(02  (0N }  .  The  optimisation  can  then  be  performed  by  minimising  the  sum  of  squares  at  each  of 
the  frequency  points.  The  minimization  of  the  cost  term  Jq  is  therefore  required  where, 


J0=i  (Fx  ~L)T(Fx-L)  =  (b-Ax  )T(b  -  Ax )  (4.20) 

k=l 


where 

A  = 

,  b  = 

L(®i) 

,  x  = 

ko 

ki 

_  F(  or  (V  )_ 

_L(a>N)_ 

k2_ 

Assuming  the  matrix  AT A  is  non-singular  the  least  squares  optimal  solution  ( Noble  1969  [34])  follows  as: 
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x  =  (Ar  A)”1  Arb  (4.22) 

Lemma  4.1  :  Restricted  Structure  LQG  Controller  Solution  Properties 

The  characteristic  polynomial  which  determines  stability  and  the  implied  equation  are  given  as: 

Pc  =  AC0d  +  BC0n 

AAqH0+BArG0=DcDf  (4.23) 

The  minimum  value  of  the  cost-function,  with  controller  of  restricted  structure,  is  given  as: 

•I nun  =  z.  r  £  {TfTi  +  T1  Tl  +0  0}ds 

2nj  d 


1  x  (rr+rp+*  '(Y0-F0Df)(Y0-F0Df) 

Jmin  -  - — :  f  7I  + -  *  _  „.* - +  <Y>0}ds 

2 nj  D  DcDcDfDf 


(4.24) 


Proof:  The  implied  equation  (4.23)  follows  from  (4.7)  and  (4.8)  by  multiplying  (4.7)  by  BAr  and  (4.8)  by 
AA  ,  and  adding. 


4.1  DESIGN  AND  ROBUSTNESS  IMPROVEMENT 

The  LQG  controller  should  be  designed  in  such  a  way  that  it  is  consistent  with  the  restricted  controller 
structure  of  interest.  For  example,  A  should  approximate  a  differentiator  if  near  integral  action  is 
required.  In  fact  the  assumption  made  in  deriving  Theorem  3.1  was  that  the  controller  structure  is 
compatible  with  the  choice  of  error  weighting  and  if  1/Aq  includes  a  j  axis  pole  then  this  will  be  included 
in  the  chosen  controller.  In  fact  the  usual  situation  will  be  that  the  designer  decides  the  controller  should 
include  integral  action  and  the  weighting  (1/A  )  will  be  chosen  as  an  integrator.  The  control  weighting 
1/Ar  is  not  so  critical  but  if  for  example,  a  PID  structure  is  to  be  used,  then  the  point  at  which  the 
differential  (lead  term)  comes  in  can  help  to  determine  the  A r  weighting.  Clearly,  there  is  no  point  in 
designing  an  LQG  controller  which  has  an  ideal  response,  in  some  sense,  but  cannot  be  approximated  by 
the  chosen  controller  structure.  Thus,  the  weightings  should  be  selected  so  that  the  closed-loop  properties 
are  satisfactory  but  taking  into  consideration  the  limitations  of  the  controller  structure  required.  The  basic 
concept  proposed  is  straightforward.  That  is,  in  the  region  of  the  unity  gain  crossover  frequency  for  the 
open  loop  system,  or  the  phase  margin  frequency,  the  distance  \1+WCq\  should  normally  be  maximised. 
This  requires  the  sensitivity  to  be  minimized,  particularly  in  this  sensitive  region.  By  costing  the 
sensitivity  directly  a  mechanism  is  provided  to  improve  robustness  (Horowitz  1979  [35])  but  there  are 
some  subtleties  to  address: 

•  The  weighting  Pc  needs  to  be  increased  from  zero  where  performance  is  presumably  maximized  (the 
LQG  cost  is  optimized)  up  to  a  level  where  robustness  is  adequate  and  performance  still  acceptable. 

•  If  pure  sensitivity  costing  is  required  Pc  could  cancel  the  combined  noise  dynamics  ®  /;  =  Y^Vj  which 
is  unrealistic.  The  alternative  is  to  make  the  model  W  =  A"  lCp  large,  relative  to  the  other  noise 
terms  and  also  a  constant  ( Cp  =  pA  say)  and  this  will  introduce  a  fictitious  stochastic  term  affecting 
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the  noise  and  disturbance  rejection,  and  reference  tracking,  properties.  The  size  of  the  scalar  p  also 
therefore  involves  a  compromise. 

•  The  weighting  and  frequency  shaping  effects  are  only  important  in  the  decade  above  and  a  little  below 
the  crossover  frequency  referred  to.  The  shaping  might  therefore  be  introduced  by  a  weighting  that 
approximates  an  ideal  window  function  but  this  increases  the  order  of  the  weighting  term. 

The  above  design  choices  and  trade-offs  detract  from  the  approach  but  the  prize  is  quite  important  and 
worthy  of  the  effort.  That  is,  the  provision  of  a  tuning  variable,  or  variables,  in  a  cost  index  where  the 
robustness  properties  of  an  optimal  controller  can  be  manipulated  and  traded  against 
performance/stochastic  properties. 


4.2  APPLICATIONS  OF  RS  TECHNIQUE  FOR  MIMO  SYSTEMS 

Although  the  theory  and  the  derivation  of  the  control  law  presented  in  the  previous  sections  assumed  for 
simplicity  a  single-input,  single -output  system,  similar  results  can  also  be  obtained  in  the  multivariable 
case.  The  restricted-structure  controller  design  problem  then  becomes  particularly  interesting  since  it  can 
be  considered  from  a  number  of  viewpoints. 

First  and  most  obvious  application  is  optimal  tuning  /  benchmarking  of  the  existing  controller.  In  the  case 
of  an  rxm  multivariable  plant,  the  controller  transfer-function  matrix  is  of  size  mxr,  however  only  a  few 
possible  feedback  connections  between  outputs  and  controls  are  probably  used.  In  fact,  the  most  common 
situation  is  a  square  system  with  as  many  inputs  as  outputs,  controlled  by  a  multi-loop  (decentralized) 
controller  -  which  is  often  a  set  of  S1SO  PID  or  PI  controllers.  By  specifying  the  required  control  structure 
to  be  the  actual  structure,  it  is  possible  to  obtain  “optimal”  parameters  for  the  existing  controller,  in  terms 
of  the  given  cost  function. 

Another  use  of  the  technique  may  be  in  I/O  pairing.  For  example,  in  a  3x3  MIMO  system,  the  following 
multi-loop  PID  control  configurations  are  possible: 


PID 

PID 

PID 

A) 

PID 

B) 

PID 

C) 

PID 

PID 

PID 

PID 

PID 

PID 

PID 

D) 

PID 

E ) 

PID 

F ) 

PID 

PID 

PID 

PID 

The  problem  is  the  “best”  choice  of  input/output  pairs  for  a  multi-loop  control  system.  Methods  such  as 
Relative  Gain  Array  are  widely  used  in  industry,  however  they  take  into  account  only  steady-state 
information.  On  the  other  hand,  pre-calculating  the  values  of  the  cost  function  for  all  possible  multi-loop 
configurations  will  determine  the  best  “dynamic”  pairing. 

Finally,  by  computing  benchmark  figures  corresponding  to  all  possible  controller  configurations,  with  off- 
diagonal  elements,  it  is  possible  to  determine  the  potential  benefits  resulting  from  the  introduction  of 
additional  loops  to  the  system.  If  this  turns  out  to  be  greater  than  the  cost  of  installation,  wiring, 
maintenance  etc.,  the  optimal  tuning  parameters  are  then  available. 


5.0  ROBUST  CONTROL  DESIGN  EXAMPLE 

To  illustrate  the  effect  of  the  robustness  weighting  element  and  the  fictitious  robustness  signal  { p(t )}  a 
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simple  example  is  considered.  The  system  models  may  be  listed  as: 

Coprime  system  model: 

w=Bi )= _ 1000(5 +  2)(5  + 6) _ 

Aq  0  +  0.7)(s  +  3 ,9)(s  + 1 00)(52  +  2  %q>qs  +  <o\ ) 

where  £,  =  0.1  and  a>0- 10  .  Also  write 


B  BM , 


W  =  -  =  - 


o'-'o 


Disturbance  model: 


A  AqU0 


W,=^  =  - 


where  Uq  =  ( s+3.2)s 


1000 


A  (s  +  100)(s  +  2^cd0s  +  o}q  )(s  +  3.2)5 


where,  Cd  =  1000(5  +  0.7)(s  +  3.9) 

Reference  model : 


W=-  = 


1 


where 


Noise  model: 


A  (s2  +  2%co0s  +  co q  )s 
E  =  (s  +  0.7)(s  +  3.9)(s  +  3.2)(j  +  100) 


0'‘ 


n  A  (s  +  100) 


where  Cn  =0.1(s  +  0.7)(5  +  3.9)(52  +  2%oo0s  +  a>l)(s  +  3.2)s 


Fictitious  robustness  signal: 


W„  =■ 


c„ 


p 


A  (5  +  100) 


Cost  Function  Weightings 

The  cost  function  weightings  may  be  defined  as:  Qc  = 


(O.Ob +  l)(-0.0l5  +  l) 

(5  +  10“6)(-5  +  10-6) 


Rc  =(105  + 1)( —  1 05  + 1)  and  Pc  =  px 


(0.015  +  l)(-0.0l5  +  l) 
(5  +  10“6)(-5  +  10-6) 


Results 

The  frequency  responses  of  the  different  system  models  are  shown  in  Fig.  2.  The  system  is  low  pass  with  a 
resonant  subsystem  and  the  disturbance  model  includes  an  integrator.  The  measurement  noise  model  only 
rolls  off  at  high  frequencies.  Consider  first  the  full  order  optimal  case  and  the  use  of  a  large  p  - 1 000 , 

then  as  px  varies  the  unit  step  responses  of  the  closed  loop  system  are  as  shown  in  Fig.  3.  This  represents 
the  case  where  there  is  a  large  fictitious  disturbance  model  Wp  but  where  the  robustness  weighting  p 
varies  between  0  to  1000.  The  faster  responses  occur  as  p  increases,  since  the  effect  is  related  to  that 
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when  Qc  increases.  The  corresponding  closed-loop  frequency  responses  are  shown  in  Fig.  3. 


Figure  2:  Bode  Frequency  Responses  for  the  System,  Disturbance  and  Noise 


Figure  3:  Closed  Loop  System  Unit  Step  Responses  for  Fixed  p  =  1000  and  Varying  Weighting  pi 
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Figure  4:  Bode  Diagram  of  Closed  Loop  Responses  for  p  =  1000  and  Changing  Value  of  pi 
Restricted  structure  controller 

The  unit  step  responses  are  compared  in  Fig.  5  for  the  full-order  and  restricted  controller  designs.  The  case 
p-  0  and  /?,  =  1 0  was  considered  and  the  corresponding  controller  and  sensitivity-function  frequency 
responses  are  shown  in  Figs  6  and  7.  The  restricted  structure  control  is  particularly  good  from  an 
overshoot  perspective.  Flowever,  one  reason  is  the  higher  controller  gains  at  high  frequencies  for  the 
restricted  structure  control  law.  The  computed  controllers  were  obtained  as: 

Optimal  Full-Order  Controller: 

^  ^  _  0.493426(s  +  0.2205695)(s  +  0.7)(s  +  3.19963 l)(s  +  3.9)(52  +  1.9999395  +  99.99969)(5  +  100) 

(5  +  0.00000 1)(5  +  1.09342  x  10“6)(5  + 1.008371)(5  +  3.866217)(52  + 14.817935  +  65.05414) 

x(52  + 7.52086b +  166.0202) 


Optimal  Restricted  Structure: 


c0(s)  = 


-0.064124852  -  0.35876215  +  1.208584 
5(0.25  + 1) 


The  step  responses  shown  in  Fig.  8  are  for  the  case  p-px-  100 .  The  results  are  much  faster  and  the 
restricted  structure  design  is  again  good,  relative  to  the  full-order  solution. 
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Tine 


Figure  5:  Comparison  of  Closed-Loop  Unit  Step  Responses  of  Full  and  RS  Control  Designs 


Figure  6:  Bode  Comparison  of  Controller  Frequency  Responses 
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Figure  7:  Bode  Comparison  of  Sensitivity  Function  Frequency  Responses 


Figure  8:  Comparison  of  Unit  Step  Responses  for  Full-Order  and  RS  Control  Designs 


Example  Conclusions 

The  example  reveals  that  the  robustness  weighting  terms  and  the  fictitious  robust  costing  signal  {p(t)} 
certainly  affects  the  overshoots  which  represent  a  measure  of  robustness,  both  on  closed-loop  frequency 
and  time  responses.  The  tuning  variables  p  and  px  affect  the  robustness  of  this  minimum-phase  open 
loop  stable  system  in  much  the  way  expected.  However,  the  alteration  of  robustness  properties  is  not  a 
straightforward  matter,  since  any  values  of  p  and  px  above  zero  will  cause  a  measure  of  sub-optimality 
in  stochastic  (LQG  cost)  terms.  The  most  surprising  results  were  the  very  good  results  obtained  for  the 
restricted  structure  control  designs.  The  explanation  was  the  higher  high  frequency  gains  employed  that 
reduced  the  peaks  on  the  sensitivity  function  frequency  responses.  In  this  problem  changes  in  the 
measurement  noise  model  again  did  not  have  a  large  effect.  However,  results  that  were  not  shown  were 
obtained  for  a  coloured  measurement  noise  model  with  a  peak  in  the  low  frequency  range.  In  this  case  the 
controller  gains  are  significantly  reduced  and  this  slows  the  speed  of  response  of  the  system  and  the 
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overshoot  increases  markedly. 


6.0  NONLINEAR  GENERALIZED  MINIMUM  VARIANCE  CONTROL 

The  system  description  is  of  restricted  generality  and  is  carefully  chosen  so  that  simple  results  are 
obtained.  The  plant  itself  is  nonlinear  and  may  be  time-varying  and  have  quite  a  general  form.  However, 
the  reference  and  disturbance  signals  are  assumed  to  have  linear  time-invariant  model  representations. 
This  is  not  very  restrictive,  since  in  many  applications  the  models  for  the  disturbance  and  reference  signals 
are  only  LTI  approximations. 

The  system  is  shown  in  Fig.  9  and  includes  the  nonlinear  plant  model  and  the  linear  reference/disturbance 
models.  There  is  no  loss  of  generality  in  assuming  that  the  zero  mean  white  noise  sources  and 

{^(0}  have  identity  covariance  matrices  (Atherton  1982  [36]).  There  is  no  requirement  to  specify  the 
distribution  of  the  noise  sources,  since  it  will  be  shown  that  the  special  structure  of  the  system  leads  to  a 
prediction  equation,  which  is  dependent  upon  the  linear  disturbance  and  reference  models. 

6.1  System  Models  and  Signals 

The  polynomial  matrix  system  models,  for  the  ( rxm )  multivariable  system,  shown  in  Fig.  9,  may  now 
be  introduced.  Part  of  the  system  is  represented  by  linear  and  part  by  nonlinear  models.  The  linear 
disturbance,  reference  and  plant  output  subsystem  models  have  the  left-coprime  polynomial  matrix 
representation: 


YWd0{z~l),  Wr(z~l),  W0k(z~l)\  =  A-^z-^CJz-1),  Er(z-\  B0k(z~l)\  (5.1) 

The  polynomial  matrix  system  models,  for  the  system,  may  be  listed  as  follows: 

Disturbance  model:  Wd(z~1)=  A~1(z~1)Cd(z~1) 

Reference  model:  Wr(z~l)  =  A~1(z~1)Er(z~l)  (5.2) 

Without  loss  of  generality  these  models  have  the  common  denominator  polynomial  matrix  A(z~1).  Note 
that  the  arguments  of  the  polynomial  matrices  are  often  omitted  for  simplicity.  The  subsystem  associated 
with  the  plant  inputs  is  assumed  to  be  unstructured  (need  not  have  known  equations)  and  of  the  form: 

Nonlinear  time-varying  plant  model:  (Y\hi)(i)  -  Dk  ( Wku){t )  (5.3) 

where  Dk  denotes  a  diagonal  matrix:  Dk  —  diag  |z"k‘  ,z~kl  ,...,z'kr  j  of  the  common  delay  elements  in  the 
respective  output  signal  paths. 

One  of  the  main  strengths  of  the  method  is  that  no  model  is  required  for  the  nonlinear 
subsystem:  (VV{-«)(t) .  It  is  necessary  to  assume  this  is  stable  and  to  have  some  means  of  computing  the 

output  from  this  block  but  a  traditional  equation  based  model  is  not  essential.  That  is,  look-up  tables  may 
be  employed,  old  Fortran  code  may  be  available  that  enables  the  output  to  be  computed  for  a  given  input, 
or  as  in  current  research,  a  fuzzy  neural  model,  may  be  fitted  to  real  plant  data.  These  methods,  which  do 
not  involve  a  conventional  model,  can  provide  all  that  is  needed  to  compute  the  control  law.  Most  of  the 
results  do  not  need  a  more  detailed  breakdown  of  the  plant  model  structure.  However,  in  the  later 
sections,  to  show  the  system  can  be  stabilised,  it  will  be  assumed  that  any  unstable  modes  of  the  plant  are 
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included  in  a  stable/unstable  linear  time  invariant  block  of  polynomial  matrix  form:  W0k  =  A~x  B0k  .  Thus, 
the  delay  free  plant  model  term (Wku  )(t)  =  W0k  (Yv\k u)(t)  and  hence  the  total  plant  model  may  be  written 
as: 


[Wu)(t)  =  DkW,k(W,ku){t)  (5.4) 


The  signals  shown  in  the  system  model  of  Fig.  9  may  be  listed  as  follows: 


Error  signal: 

e{t)  =  r{t)-y(t) 

(5.5) 

Plant  output: 

y(t')  =  d(t')  +  (PPu)(t') 

(5.6) 

Reference: 

II 

(5.7) 

Disturbance  signal: 

d(t)  =  Wdf(t) 

(5.8) 

Combined  signal: 

^3 

1 

II 

(5.9) 

The  power  spectrum  for  the  combined  reference  and  disturbance  model  can  be  computed,  noting  these  are 
linear  subsystems,  using: 

0ff  =Orr+Odd  =  WrW;  +  WdW*d  (5.10) 
and  the  generalized  spectral-factor  Yf  may  be  computed  using: 

YfYf=0ff  ((5-1 1) 

where  the  system  models  ensure  Yf  is  strictly  minimum  phase.  Note  that  a  measurement  noise  model  has 
not  been  included  to  simplify  the  equations.  This  is  appropriate  so  long  as  the  control  cost-function 
weighting,  introduced  in  the  next  section,  ensures  controller  roll-off  at  high  frequencies. 

6.2  Optimal  Nonlinear  Generalized  Minimum  Variance  (NGMV)  Problem 

The  optimal  NGMV  control  problem  involves  the  minimisation  of  the  variance  of  the  signal 
I  </y(t) }  in  Fig.  9.  This  signal  involves  a  ( rxm )  dynamic  cost  function  weighting  matrix:  Pc(z~x)  on  the 

error  signal,  represented  by  linear  polynomial  matrices  as:  Pc  =  Pad  Pen-  It  also  includes  an  m-square, 
nonlinear  dynamic  control  signal  costing  operator  term:  (jrcu)j(t'j.  The  choice  of  dynamic  weightings  is 
critical  to  the  design  and  typically  Pc  is  low-pass  and  is  a  high-pass  transfer.  The  signal: 

</>o(t)  =  Pce{t)  +  {Jrcu){t)  (5.12) 

is  to  be  minimized  in  a  variance  sense,  so  that  the  cost  index  to  be  minimised: 

J  =  e[</>o  (i)4(t)}  =  £’{trace{4(?)4r  (f)}}  (5.13) 
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where  is{-}  denotes  the  expectation  operator.  Note  that  in  some  applications  the  signal  tp{)  (/)  may 

represent  an  inferred  output.  That  is,  this  signal  can  represent  the  output  from  a  subsystem  that  cannot  be 
measured  directly. 


h  =  pce  + 


Disturbance 

model 


Figure  9:  Single  Degree  of  Freedom  Closed  Loop  Feedback  Control  System  for  the  Nonlinear 
Plant  (signal  <j>o  is  dependent  on  the  weightings  shown  dotted) 

If  the  smallest  of  the  delays  in  each  output  channel  of  the  plant  are  of  magnitudes:  {kx,k2,...,ky} , 
respectively,  this  implies  the  control  at  time  t  affects  the  jth  output  at  least  k-  steps  later.  For  this  reason 
the  control  signal  costing  can  be  defined  to  have  the  form: 

(^cu)(t)  =  Dk(^cku)(t)  (5.14) 

Typically  this  will  be  a  linear  operator  but  it  may  also  be  chosen  to  be  nonlinear  to  cancel  the  plant  input 
nonlinearities  in  appropriate  cases.  The  control  weighting  operator  kFck  is  assumed  to  be  full  rank  and 
invertible. 

Theorem  6.1:  NGMV  Optimal  Controller 

The  NGMV  optimal  controller  to  minimize  the  variance  of  the  weighted  error  and  control  signals 
may  be  computed  from  the  following  equations.  The  assumption  is  made  that  the  nonlinear  possibly  time- 
varying  operator  (PckVk  -  yrck )  has  a  stable  causal  inverse,  due  to  the  choice  of  weighting  operators  Pc 
and  kFc  .  The  smallest  degree  solution  (Go, Fq),  with  respect  to F0,  must  be  computed  from  the  polynomial 
matrix  equation:  ApPcdF0  +  Dk  G0=PcfDf  (5.15) 

where  the  left  coprime  polynomial  matrices  Apj  and  Pt/  satisfy:  A~  1  P(.f  =  Pcn  A  1  (5.16) 
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and  the  spectral  factor  Yj  is  written  in  the  polynomial  matrix  form:  Yj-  =  A  1  Dj . 

Optimal  control  signal:  The  optimal  NGMV  control  action  can  be  computed  as: 

u{t)  =  (F0Y/Hi-^k  )_1  [(^ApPcd  )_1  G0Tf_1e)(t)  (5.17) 

Proof:  The  proof  involves  collecting  results  in  the  next  section.  ■ 

Remarks: 

•  The  following  solution  is  simplified  if  Dk  and  the  weighting  Pc  and  spectral  factor  Y f  commute. 

This  assumption  is  certainly  valid  if  the  delay  elements  are  the  same  in  each  channel  Dk  =  z~k  I  or  if 
Pc  and  Y f  are  diagonal  transfers,  which  is  also  reasonable  for  many  applications. 

•  The  class  of  problems  considered  are  those  for  which  a  solution  to  the  Diophantine  equation  can  be 
found  where  the  row  degrees  of  F0(z_1)  are  less  than  the  delay  path  magnitudes  {k1,k2,...,kr}  and 
this  is  ensured  under  the  conditions  listed  in  the  previous  remark. 

•  A  further  consequence  of  the  above  assumption  is  that  the  matrices:  F0 Yj 1 ,  G0  Yj 1  and  Dk  commute, 
which  is  a  property  employed  later  in  the  proof. 

6.3  Solution  of  the  Nonlinear  Optimal  Control  Problem 

A  simple  optimisation  argument  is  used  in  the  following.  The  signal  to  be  minimised  is  shown  to 
consist  of  both  linear  and  nonlinear  terms.  However,  the  stochastic  part  of  the  problem  involves  linear 
models  so  that  a  prediction  equation  may  easily  be  derived.  This  enables  the  signal  to  be  written  in  terms 
of  future  and  past  white  noise  related  terms.  The  optimal  causal  solution  is  therefore  that  which  sets  the 
past  terms  to  zero.  This  will  include  some  of  the  nonlinear  control  input  dependent  terms  and  the  optimal 
control  follows. 

Consider  the  minimisation  of  the  signal  j^0(?)},  where  (?)  represents  the  weighted  sum  of 
error  and  control  signals  and  is  the  same  dimension  as  the  input  signal.  This  fictitious  or  inferred  output  is 
defined  as:  ^0(f)  =  i).e(t)  +  (^,M)(f)  ,  where  Pc  is  assumed  to  be  a  linear  and  Jrc  can  be  a  linear  or 
nonlinear  operator.  Now  from  the  equations  in  §6.1:  e-r-y-r-d-Wu  and  hence, 

4  (?)  =  pc  (r  -  d  - >Vu)  +  JPcu  =  Pc(r  -d)- (PckV-^c)u  (5.18) 

Notice  this  last  term  involves  non-linear  operators  and  the  notation  implies  that  the  signal: 

{PcW-Src)«  =  Pc{Wu){t)-{S-cu){t) 

Assumption:  An  important  assumption  will  now  be  recalled  that  does  not  affect  stability  properties 

but  may  cause  a  degree  of  sub-optimality  in  disturbance  rejection.  That  is,  the  model  for  the  signal 
f  -  r-d  is  assumed  to  be  linear.  In  fact,  disturbance  models  are  often  determined  by  nonlinear  power 
spectrum  models  but  are  approximated  well  by  a  linear  system  driven  by  white  noise.  A  typical  example 
arises  in  ship  positioning  applications  where  the  Pierson-Moskowitz  spectrum  is  used  to  model  wave 
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motion  but  where  this  is  normally  approximated  by  a  lightly  damped  second-order  linear  system. 

Spectral  Factor:  Recall  the  signal:  /  =  Yfz  ,  where  Yf  is  a  linear  transfer  and  rft'j  denotes  a  zero 

mean  white  noise  signal  of  identity  covariance  matrix.  The  operator  Yf  follows  from  a  standard  spectral- 
factor  computation,  given  the  disturbance  Wd  and  the  reference  Wr  signal  models  (Grimble,  1994  [18]). 
The  Yf  may  be  assumed  to  have  the  following  polynomial  matrix  form:  Yf  =  A~lDf ■  where  from  the 
system  description  Df  is  strictly  Schur.  Thence,  from  the  first  term  in  (5.18): 

Pc{r~d)  =  PJ  =  PcYfe  =  P^PcnA-'DfE 

Now  introduce  the  left  coprime  polynomial  matrices  A  and  Pcf  satisfying:  PcnA-l=A-lPcf 


then  Pcf  =  [ApPcd ) 
form: 


PcfDf£  and  from  equation  (5.15)  the  weighted  error  and  control  signals  have  the 


<k  =  {ApPcdf  PcfDfz-{PcW-^c)i 


(5.19) 


6.3.1  Diophantine  equation 

Introduce  the  following  linear  Diophantine  equation,  to  expand  the  combined  disturbance  and 
reference  model  into  two  groups  of  terms:  ApPcdF0  +  DkG0  =  PcfDf  (5-20) 

where  the  solution  for  ( F0 , G0 )  satisfies  the  row  j  degree  of  F0  <  kj.  Hence, 

PcYf=(ApPcd)XPcfDf  =F,  +  (ApPcd)~X  DkG0  (5.21) 

The  first  polynomial  matrix  includes  delay  elements  in  the  jth  channel,  up  to  and  including  z  kj+l  and  the 
last  term  involves  delay  elements  greater  than  or  equal  to  k:  in  each  output  channel.  Substituting  into 

(5.19),  obtain  the  inferred  output  signal: 

</>o=F0  8+  (ApPcd}lDkG0£-(PckV-S-c)u  (5.22) 

but  8  =  Yf */  =  Yf1  ( e  +  yPu )  =  Yfl(r  -  d)  and  hence  substituting  in  (5.22): 

</>o=F0 8  +  (^ApPcd )  *  DkG0Yfle  -  ^-(^ApPcd )  '  DkG0YflFP  +  PCPP  -Jfju 

=  P0z  +  (APPcd )"'  DkG0Yf~le  +  )_1  (d,G0  -  ApPcdPcYf  )  Yf'W  +  Jfju  (5.23) 

but  ApPcdPcYf  —  APPcn  A~X  Df  =PcfDf  and  hence  (5.21)  gives:  DkG0-PcfDf  =-ApPcdF0 
These  last  two  equations  then  give  the  desired  weighted  error  and  control  signal  as: 


2-24 


RTO-EN-SCI-166 


Advanced  Control  Law  Tuning  and  Performance  Assessment 


•k  =F0z  +  (ApPcd}1  DkG0Y^e  +  (s-c-F0Y;lw)u  (5.24) 

The  control  signal  at  time  t  affects  the  jth  system  output  at  time  t+kj  and  hence  the  control  signal  costing 
term  YFC  should  include  a  delay  of  kj  steps,  so  that  JFC  =DkJrck.  Moreover,  since  in  general  the  control 

signal  costing  is  required  on  each  signal  channel,  the  Jrck  weighting  may  be  defined  to  be  of  full  rank  and 
invertible.  Equation  (5.24)  may  be  simplified  further  if  ApPcd  and  Dk  and  F0Yf- 1  and  Dk  commute, 
which  is  certainly  the  case  under  the  assumptions  on  Pc  and  Y j  discussed  after  the  Theorem  at  the  end  of 
the  last  section.  From  (5.24)  the  inferred  or  fictitious  output  may  be  written  as: 

4(f)  =  F0s(0  +  A-((^(0-W(^u)(0  +  (^rlGo^le(0)  (5.25) 

6.3.2  Optimisation 

To  compute  the  optimal  control  signal  inspect  the  form  of  the  weighted  error  and  control  signals  in 
equation  (5.25).  Since  the  row  degrees  of  F{]  are  required  to  be  less  than  k-  (the  magnitude  of  the  delay  in 

the  jth  channel)  the  jth  row  of  the  first  term  is  dependent  upon  the  values  of  the  white  noise  signal 
components:  e(f)  ,...,e\t-kj  +1) .  The  remaining  terms  in  the  expression  for  the  jth  row  are  all  delayed 

by  at  least  kj  steps  and  therefore  depend  upon  the  earlier  values:  &(t-k) ,  £  (t  -  kj  - 1 ) , . . .  .it  follows  that 

the  first  and  the  remaining  terms  are  statistically  independent.  The  first  term  on  the  right  of  (5.25)  is 
independent  of  the  control  action  and  the  smallest  variance  is  achieved  when  the  remaining  terms  are  set  to 
zero.  The  optimal  control  signal  must  therefore  satisfy: 

u(t)  =  [f,YJx  -  (ApPcd  )"'  GoT/1e(0)  (5.26) 

and  this  may  be  represented  in  the  block  diagram  of  Fig.  10. 

Recall  the  signal  <f>0  =  Pce  +  YFcu  =Pc(r-  v)  +  YFcu  involves  a  weighting  YFC  that  normally  has  a  negative 
sign  to  ensure  (j){)  is  minimized  by  a  signal  u  having  negative  feedback.  The  forward  path  gain  of  the 
controller  block  is  therefore  usually  positive. 
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Disturbance 


Figure  10:  Control  Signal  Generation  and  Controller  Modules 


6.3.3  Alternative  Expression  for  the  Control  Signal 

An  alternative  expression  for  the  control  signal  may  be  found  that  is  useful  for  stability  analysis. 
From  equation  (5.26),  recalling  that  F0 Yj 1  and  Dk ,  are  assumed  to  commute: 


«(0  =  ■ ^  - (ApPcd  )_1  GoF/1  (r(0 d  {t) - (^)(0)) 


Note  from  the  Diophantine  equation  (5.20)  the  first  term: 

FJf~X  +  (Ap  Pcd  )"'  DkGJfl  =  (ApPcd )~‘  (ApPcd  F0  +  DkG0 )  Y/1  =  (. ApPcd  )' 
Thence,  the  expression  for  the  control  signal  simplifies  as: 

orsince:  (*^t  ~  pMk)u  =  ~(ApPcd)  '  G0Yf~l  (r(t)~  d  (t)) 


pcfA  =p;Jpct 


u{t)  =  (PM  -  M  )_1  (ApPcd  )_1  G0Yf~l  (r{t)  -  d  (*))  (5.27) 

where  the  existence  of  a  stable  causal  inverse  of  the  nonlinear  operator  (PcFFk  -  JFck  j  is  assumed.  This 
latter  result  may  also  be  confirmed  by  applying  the  optimisation  argument  directly  to  (5.24). 

Note  that  the  nonlinear  operator  may  be  computed,  assuming  the  existence  of  the  inverse  of  the 
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control  weighting  dFck .  For  example,  if  the  signal  y/{t)  is  defined  as: 

¥{t)  =  (PcHi-^ck  )m  =  Pc(^«)(0-(^«)(0 

then  the  control  signal:  u[t)  =  J^k~l  j 

but  this  cannot  be  used  for  computations,  since  the  right-hand  side  also  includes  u  at  time  t. 

To  avoid  this  problem  in  the  computation  of  u{t)  the  operator  ( Pc}%  -  A\.k  )  must  be  split  into 
two  parts  involving  a  term  without  a  delay  J\f0  and  a  term  that  depends  upon  past  values  of  the  control 
action  ^  .  That  is,  y/(t)  =  (PcyPk  -  J^k)u  =  (A/r0u)(t)  +  (A/rlu)(t)  (5.28) 


and  the  control  signal  can  then  be  computed  as: 

u(t)  =  Af0-\y,{t)-Wu)(t))  (5.29) 


Since  the  weightings  may  be  chosen  freely  the  pulse  response  operator  Af0  can  be  assumed  to  be  full  rank 
and  the  inverse  exists.  This  may  be  computed  from  . A / u  =  ( PCM{  -  jFck  )u,  if  a  model  is  available,  by 
setting  z~l  -  0,  so  that:  AA0  -  A/”|z-1=0=  {PC^K  ~^hk  )|  -i  0  and  JV{  =  Af  -  A/^ .  The  results  suggest 
the  method  of  implementing  the  inverse  operator  shown  in  Fig.  11. 


Plant  and  weighting  operators 


Figure  11:  Nonlinear  Time-Varying  Operator  ( PcPl{.  —  dFck  )  and  its  Inverse 

Limiting  Case:  Note  that  in  the  limiting  case,  for  a  square  system,  when  JP;k  — >  0  the  optimal 

control  signal  (5.27)  becomes:  ^minvar  (0  =  {^c^i )  '  [ApPcd  )  G0Ty:1(r(t)-f/(t))  and  clearly  the 

minimum  variance  control  for  the  nonlinear  system  includes  the  stable  inverse  of  the  plant  model,  when 
one  exists.  The  nonlinear  terms  are  cancelled  in  this  somewhat  unrealistic  case.  In  practice  control 
costing  must  be  employed  to  ensure  unrealistic  high  gains  do  not  occur,  and  hence  pure  minimum  variance 
control  is  not  normally  an  option.  Flowever,  the  link  to  this  well  known  controller  is  valuable  to  provide 
confidence  in  the  solution. 
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6.3.4  Existence  of  the  Stable  Nonlinear  Operator  Inverse 

The  above  result  (5.27)  indicates  a  necessary  condition  for  optimality  is  that  the  operator 
(PcPVk  ~  •'dt  )  must  have  a  stable  inverse.  For  the  case  of  linear  systems  the  requirement  is  that  the 

operator  is  strictly  minimum-phase.  This  reveals  that  one  of  the  restrictions  on  the  choice  of  cost 
weightings  is  that  this  stability  condition  be  fulfilled. 

The  simple  solution  obtained  to  this  problem  depends  upon  the  existence  of  the  stable  inverse  of 
the  operator  ( PcFFk  -  dFck  )•  An  important  question  is  whether  sensible  choices  of  the  weightings  will  lead 
to  this  condition.  To  show  this  covers  a  very  wide  class  of  systems  consider  the  case  where  J\.k  is  linear 
and  negative  so  that:  jFck  =  -Fk .  Then  obtain:  ( PCM{  +Fk)u-Fk  yFkxPcFVk  +  I )  u  and  note  that  the  term 

(  /  +  Ffl  Pppj.  j  represents  the  return-difference  operator  for  a  system  with  feedback  controller 

Kc  -  Fk  1  Pc .  This  interesting  equivalence  is  important,  since  the  stability  of  the  inverse  operator  is  clearly 
related  to  the  stability  of  the  feedback  loop.  It  will  be  shown  later  that  this  feedback  loop  actually  arises  in 
a  nonlinear  version  of  the  Smith  Predictor. 

Starting  Point  for  Weighting  Selection:  Consider  the  delay  free  plant  Mj.  and  assume  a  PID 

controller  exists  Kc  to  stabilize  the  closed  loop  system.  Then  a  starting  point  for  weighting  choice,  that 
will  ensure  ( PCH{  +  Fk )  is  stably  invertible,  is  Ff 1  Pc  -  Kc.  That  is,  when  the  optimal  controller  is  to  be 

applied  to  an  existing  system  it  is  very  likely  that  a  PID  controller  that  stabilizes  the  plant  is  available,  and 
will  therefore  provide  a  starting  point  for  weighting  selection.  To  demonstrate  that  such  a  weighting 
selection 

is  reasonable  consider  the  scalar  case  and  let  a  controller^  have  the  form: 

*c=*o  +  ^_i  +k2(l-z~l)  =  ((ko  +kl+k2)-(k0  +  2k2)z~l +k2z~2y(\-z~1')  (5.30) 

and  assume  the  PID  gains  are  positive  numbers,  with  small  derivative  gain.  Then  it  is  simple  to  confirm 
that  if  Fk  - 1  the  Pcn  term  is  minimum  phase  and  has  real  zeros.  The  Pc  =  PfJ  Pcn  term  then  includes 
integral  action  at  low  frequency  and  a  lead  term  at  high  frequency,  which  implies  the  resulting  optimal 
controller  should  have  similar  characteristics  to  the  PID  controller  involving  high  gain  at  both  low  and 
high  frequencies.  This  suggests  that  some  modification  will  be  necessary  to  limit  the  gain  at  high 
frequencies.  Using  the  equivalence  to  a  filtered  PID  controller  leads  to  weightings  of  similar 
characteristics  but  with  a  limited  gain  at  high  frequencies.  Recall  the  above  argument  was  aimed  at 
showing  there  are  likely  to  be  weighting  choices  that  are  reasonable  and  lead  to  a  stable  inverse  for  the 

operator  [PcPPk  -  JFck  j,  under  the  above  very  reasonable  assumption. 

Realising  the  Controller  and  Minimum  Cost 

The  controller  expression  may  also  be  expressed  using  the  inverse  of  the  nonlinear  operator  (from  (5.26)) 
as:  u(t)  =  (F0Y/pyk-^k  )"' ((4^  )"' G  0F/-'e)(t)  (5.31) 

The  controller  has  the  structure  shown  in  Fig.  12,  or  equivalently,  expanding  the  nonlinear  compensator 
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block,  it  corresponds  with  the  controller  structure  of  Fig.  10. 


Controller  Subsystem 


Disturbance 


Figure  12:  Equivalent  Single  Degree  of  Freedom  Nonlinear  Controller  Structure 

Minimum  Cost: 

The  minimum  cost  is  clearly  due  to  the  first  (time-invariant  linear)  term  in  (5.24).  That  is,  using  ParsevaTs 
theorem:  Jmin  =  F{(F0s(t))r(F0s(?))J  =  ^—  trace  Jf0(z_1)f0(z-1)  Jy  (5.32) 


It  is  interesting  that  this  expression  for  the  minimum  cost,  which  can  provide  a  benchmark  cost  for 
nonlinear  controller  design,  depends  only  on  the  reference  and  disturbance  signal  models  that  are  LTI. 
This  arises  because  the  control  action  effectively  removes  the  nonlinear  plant  model  from  the  prediction  of 
the  signal  {^(t)},  whose  variance  is  being  minimised. 

Minimum  Variance  and  Special  Cases 

To  verify  the  solution  in  the  special  case  when  the  control  weighting  SFC  tends  to  zero,  a  modification  to 
the  proof  is  required.  From  equation  (5.26)  the  optimal  control  signal  must  satisfy: 

F0Yf~l(Miu  )(t)  =  (ApPcdylG0Yf-'e(t ) 

For  a  square  system,  where  the  inverse  of  PVk  exists,  the  minimum  variance  control,  for  a  nonlinear, 
possibly  time-varying,  process:  «(t)  =  |^._1  ^TyF0"1Fc)/1^4“1G0T):Iej j(t)  (5.33) 

and  if  the  system  is  scalar,  then  for  nonlinear  systems:  u  (t)  -  [h{~1  ^F0“ 

If  now  the  plant  is  assumed  linear  then  the  plant  operator  H{  may  be  written  in  polynomial  form  as: 

Wk  =  A~lBok  and  the  scalar  minimum  variance  control:  u(t )  = - - - e(t)  (5.34) 

BtikCVcd 

This  is  the  stabilizing  control  if  the  process  is  minimum  phase.  This  is  a  reminder  that  the  GMV  control 
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law,  with  control  costing,  is  only  stabilizing  when  the  control  weighting  is  chosen  appropriately.  There 
are  also  many  applications  where  the  inclusion  of  the  inverse  of  the  plant  model  in  the  control  law  would 
suggest  unrealistically  high  gains  and  poor  behaviour  to  noise  and  uncertainties.  However,  this  special 
case  is  relatively  simple  and  there  are  some  applications  like  the  winding  mechanisms  for  coal  mines, 
where  inverse  static  characteristics  are  employed  routinely. 

Relationship  to  the  Smith  Predictor 

The  optimal  controller  can  be  expressed  in  a  similar  form  to  that  of  a  Smith  Predictor.  This  provides  a 
new  nonlinear  version  of  the  Smith  Predictor.  Moreover,  it  provides  an  optimal  method  of  tuning  and 
provides  optimal  stochastic  disturbance  rejection  and  tracking  properties.  However,  the  introduction  of 
this  structure  also  limits  the  application  of  the  solution  on  open-loop  unstable  systems.  That  is,  although 
the  structure  illustrates  a  useful  link  between  the  new  solution  and  the  Smith  time  delay  compensator,  it 
also  has  the  same  disadvantage,  that  it  may  only  be  used  on  open-loop  stable  systems.  Nevertheless  the 
structure  is  intuitively  reasonable  and  should  be  valuable  in  applications.  This  Nonlinear  Smith  Predictor 
will  now  be  derived. 

Observe  that  the  system  in  Fig.  10  may  be  redrawn  as  in  Fig.  13.  The  changes  are  made  to  the  linear 
subsystems  by  adding  and  subtracting  equivalent  terms. 


Compensator 


Figure  13:  Modification  to  the  Controller  Structure  Shown  Dotted 

Now  combine  the  two  linear  inner  loop  blocks,  by  first  defining  the  signal:  mk(t}  =  (M{u)(t)  as 

follows:  [F,Yfl  +  ( ApPcd )"'  =  (, ApPcd )"'  (ApPcdF0  +  DkG0)Yf~lmk 

but  substituting  from  (5.20),  assuming  Dk  and  G0  commute: 

( W*  +  (ApPcd  )~‘  G0Yf~xDk)mk  =  Pcmk  (5.35) 

The  system  may  therefore  be  redrawn  as  shown  in  Fig.  14  where  the  control  action  clearly  satisfies 
equation  (5.27).  Now  observe  that  the  compensator  may  be  rearranged,  as  shown  in  Fig.  15.  This  latter 
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structure  is  essential  if  Pc  includes  an  integrator,  which  introduces  integral  action.  That  is,  PCJ  must  be 
placed  in  the  inner  error  channel,  rather  than  in  individual  blocks  as  in  Fig.  14. 


Disturbance 


Figure  14:  Nonlinear  Smith  Predictor  Compensator  and  Internal  Model  Structure 

The  structure  in  Fig.  14  is  intuitively  reasonable  and  easy  to  explain.  Note  from  the  control  signal  u  to 
the  feedback  signal  p  that  the  transfer  is  null  when  the  model  DkH{  matches  the  plant  model.  It  follows 
that  the  control  action,  due  to  reference  signal  r  changes,  is  not  due  to  feedback  but  involves  the  open- 
loop  stable  compensator  involving  the  block  A~lG0YJl  and  the  inner  nonlinear  feedback  loop.  This  inner- 

loop  has  the  weightings  J\.kPc  acting  like  an  inner-loop  controller.  If  these  weightings  are  chosen  to  be 
of  the  usual  form  it  will  represent  a  filtered  PID  controller.  Thus,  the  control  action  due  to  the  reference 
changes  will  be  due  to  the  cascade  of  these  two  blocks. 

It  should  be  emphasized  that  the  choice  of  the  weightings  to  be  equal  to  a  PID  control  law  is  only  a 
starting  point  for  design,  since  stability  is  easier  to  achieve.  However,  the  control  weighting  can  have  an 
additional  lead  term  (or  alternatively  a  high  frequency  lag  term  may  be  added  to  the  error  Pc  weighting. 
The  high  frequency  characteristics  of  the  optimal  controller  will  then  have  more  realistic  roll  off.  Under 
the  given  assumptions  the  resulting  system  is  stable.  This  follows  because  the  plant  is  stable,  the  inner- 
loop  is  stable  (due  to  choice  of  weightings)  and  there  are  only  stable  terms  in  the  input  block. 
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Compensator 


Disturbance 


+ 


Figure  1 5:  Nonlinear  Smith  Predictor  Compensator  and  Internal  Model  Structure  ( y/  3  =  Pcd  W  ) 

Design  Issues 

In  general,  it  seems  relatively  easy  to  obtain  an  NGMV  design  very  close  (and  normally  better)  than  the 
existing  PID  performance,  and  then  use  the  proposed  parameterization  (which  is  only  one  of  a  number  of 
possible  choices)  to  achieve  further  improvement.  The  approach  is  not  in  competition  with  PID  of  course. 
There  is  every  reason  to  use  the  simplest  possible  controller  that  will  do  the  job.  The  NGMV  was  only 
compared  with  the  PID  design  for  this  low  order  problem.  It  has  the  advantage  that  if  the  plant  is  high 
order,  a  stabilising  PID  control  law  may  not  exist 

As  the  controller  includes  the  nonlinear  model  of  the  plant,  it  should  be  robust  against  any  changes  of  the 
operating  point,  whereas  any  linear  controller  may  have  problems  regulating  across  the  whole  operating 
range.  The  above  results  do,  of  course,  correspond  to  no  (or  little)  plant/model  mismatch.  The  choice  of 
cost  weightings  to  optimise  robustness  will  be  the  subject  of  future  research. 

Some  of  the  other  areas  where  further  investigation  is  warranted,  that  relate  to  design  and  applications, 
may  be  listed  as: 

•  A  slight  generalization  is  to  define  a  completely  nonlinear  objective  function,  so  that  the  error 
weighting  is  nonlinear  Pc . 

•  Constraints  on  input  actuators,  like  mechanical  bending  limits,  can  be  allowed  for  using  barrier 
functions,  which  may  be  absorbed  into  the  plant  model  as  a  further  nonlinearity. 

•  Integral  action  can  be  included  by  defining  the  error  weighting  to  include  an  integrator.  In  this  case 
the  way  in  which  the  optimal  control  deals  with  windup  should  be  analyzed. 

•  The  inclusion  of  different  model  structures/types  for  the  plant  model  W ,  like  a  neural  network,  to 
provide  a  learning  capability. 

•  A  low  order  solution  may  be  obtained  using  a  restricted  structure  control  design  philosophy. 
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7.0  NONLINEAR  GMV  CONTROL  PROBLEM 

The  computation  of  a  NGMV  controller  is  illustrated  below  in  the  design  of  a  scalar  nonlinear  discrete¬ 
time  dynamic  system,  given  in  the  following  nonlinear  state-space  form: 


Xj  ( t  + 1) 


*i(0-*2(0 

1  +  xf  ( t ) 


+  u(t ) 


x2(t  +  l)  =  e-xht)xt(t)  +u(t) 
y(t)  =  x1(t-  4) 


Let  the  initial  state  x(0)  =  0  (the  stable  equilibrium  point  of  the  autonomous  system).  Observe  that  the 
output  y(t)  includes  a  transport  delay  of  k  =  4  samples.  The  open-loop  system  response  to  a  series  of  steps 
is  shown  in  Fig.  16,  and  the  nonlinearity  present  in  the  system  is  clearly  evident  from  the  wide  range  of 
responses. 
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Figure  16:  Open-loop  Plant  Responses  for  Operating  Regions 


For  the  nonlinear  GMV  controller  design,  the  linear  reference  model  has  been  defined  as: 
Wr  =  0.()5/(  1  —  f).  99z  'j,  and  is  the  stochastic  analogue  of  a  near  step  reference  changes.  The  model  of  the 

additive  linear  disturbance  acting  on  the  system  output  was  chosen  as:  Wd  =  0. 05/(1 -0.8z  j.  Assume  the 

plant  is  controlled  by  the  nominal  stabilizing  PID  controller,  denoted  C,  (z  1 ) ,  with  filtered  derivative 


term: 


CX=K 


1  +  - 


i _ , 

m -z-1)  a -rdz-x) 


and  with  the  tuning  parameters:  K=0.1,  T;=4s,  Td=ls  and 


td=0.5.  As  explained  in  section  4.4,  the  nominal  dynamic  weightings  for  the  NGMV  design  may  be 
defined  in  terms  of  this  controller  as:  Pc  =  C] ,  Fc  —  -z-4 .  The  Bode  plots  of  these  weightings  are  shown 
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in  Fig.  17. 


Bode  Diagram 


10‘2  10*1  10°  101 

Frequency  (rad/sec) 

Figure  17:  Frequency  Responses  of  the  Dynamic  Weightings  (nominal  design) 

The  reference  tracking  of  a  sequence  of  steps  for  the  two  nominal  controllers  is  shown  in  Fig.  18,  and  the 
corresponding  output  and  control  signal  variances  are  collected  in  Table  1.  Note  that  the  nominal  PID 
tuning  parameters  have  only  been  found  to  stabilize  the  delay-free  plant  and  are  not  ‘optimized’  in  any 
sense.  However,  this  controller  is  useful  in  that  it  can  provide  initial  design  parameters  for  the  NGMV 
controller  that  will  stabilize  the  plant,  i.e.  make  the  nonlinear  operator  stable  and  invertible.  As  can  be  seen 
from  Fig.  18  and  Table  1,  the  performance  of  the  initial  nonlinear  controller  design  is  close  to  that  of  the 
original  PID,  although  it  is  normally  more  robust  to  the  changes  of  the  operating  point  (this  can  be  seen  for 
the  set-point  equal  to  zero).  The  stochastic  performance  of  the  nonlinear  controller  is  also  slightly  better. 
The  importance  of  this  result  is  not  the  controller  produced  but  that  it  provides  a  painless  way  to  obtain  an 
initial  choice  of  cost  weightings. 
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Figure  18:  Time  Responses:  Nominal  PID  and  NGMV  Controllers 


Table  1:  Stochastic  Performance:  Nominal  PID  and  NGMV  Controllers 


Op.  point 

Controller 

Var[e] 

Var[u] 

Var[phi] 

3 

PID 

0.01662 

0.00117 

0.00082 

NGMV 

0.01672 

0.00104 

0.00082 

0 

PID 

0.01568 

0.00046 

0.00054 

NGMV 

0.01180 

0.00025 

0.00038 

-1 

PID 

0.01122 

0.00168 

0.00051 

NGMV 

0.01117 

0.00166 

0.00052 

-3 

PID 

0.00729 

0.00115 

0.00035 

NGMV 

0.00726 

0.00115 

0.00033 

The  nominal  NGMV  design  will  now  be  modified  by  changing  the  control  weighting.  Parameterize  the 
control  weighting  as:  Fck  =  -p(\-yz~l)  ,  where  p  is  a  positive  scalar  and  y  is  a  value  from  0  to  1,  to 
introduce  a  lead  term  to  the  weighting.  Note  that  a  lead  term  on  the  control  signal  weighting  is  normally 
useful  to  reduce  the  high  frequency  gain  of  the  controller.  For  the  nominal  design:  p=l  and  y=0.  The  Bode 
plots  for  some  combinations  of  these  two  parameters  are  shown  in  Fig.  19. 
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Bode  Diagram 


io'2  10'1  10°  101 

Frequency  (rad/sec) 


Figure  19:  Frequency  Responses  of  the  Dynamic  Cost  Function  Weightings 
Pc  (solid),  Fc  with  y=0  (dashed),  Fc  with  p=1  (dotted) 

The  parameterization  involves  two  tuning  parameters  and  is  meant  to  simplify  the  design  task.  For 
example,  decreasing  the  value  of  p  (reducing  the  control  weighting)  leads  to  a  faster  response  and  a  more 
violent  control  action.  This  can  also  be  seen  from  the  stochastic  performance  results  (with  added 
disturbance  noise).  Interestingly,  there  is  very  little  change  in  the  error  variance.  On  the  other  hand,  while 
decreasing  p  to  a  value  of  0.35  leads  to  some  undesirable  oscillatory  behaviour,  adding  a  lead  term  helps 
resolve  this  problem.  The  Figure  12  and  Table  2  present  the  simulation  results  for  different  values  of 
p  (y=0).  As  expected  increasing  p.  results  in  a  slower  step  response  characteristic,  providing  a  simple 
tuning  mechanism. 
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Figure  20:  Time  Responses  for  Weighting  Parameters:  p  =  0.5,  0.7, 1 ,  2;  y  =  0 
Table  2:  Stochastic  Performance  for  Weighting  Parameters:  p  =  0.5,  0.7, 1 ,  2;  y  =  0 


Op. 

point 

rho 

Var[e] 

Var[u] 

Var[phi] 

3 

0.5 

0.01528 

0.00334 

0.00263 

0.7 

0.01515 

0.00199 

0.00162 

1 

0.01512 

0.00120 

0.00131 

2 

0.01518 

0.00049 

0.00169 

0 

0.5 

0.01330 

0.00095 

0.00233 

0.7 

0.01324 

0.00059 

0.00126 

1 

0.01322 

0.00035 

0.00077 

2 

0.01312 

0.00013 

0.00055 

-1 

0.5 

0.01042 

0.00345 

0.00352 

0.7 

0.01017 

0.00201 

0.00171 

1 

0.01015 

0.00117 

0.00127 

2 

0.01023 

0.00063 

0.00214 

-3 

0.5 

0.00604 

0.00182 

0.00258 

0.7 

0.00593 

0.00129 

0.00140 

1 

0.00595 

0.00089 

0.00104 

2 

0.00617 

0.00042 

0.00141 

For  comparison,  the  nominal  PID  controller  has  been  retuned  and  its  performance  compared  with  that  of 
the  NGMV  controller  with  design  parameters  p=0.5  and  y=0.3.  A  set  of  PID  parameters  were  obtained 
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that  were  close  to  the  NGMV  design  in  terms  of  the  speed  of  response,  but  the  plant  nonlinearity  still 
caused  some  oscillatory  behaviour  in  the  PID  control  design  responses. 

In  the  last  experiment,  the  plant  time  delay  was  increased  from  4  to  10  samples.  For  the  controller  design, 
the  same  weightings  were  used  as  before  but  the  NGMV  controller  obtained  was  of  course  different, 
reflecting  the  change  in  the  time-delay.  Then  the  Nonlinear  Control  Design  Blockset  of  Matlab  was  used 
to  find  the  optimal  PID  parameters,  given  the  desired  response.  The  boundary  constraints  were  relaxed 
until  a  feasible  set  of  parameters  was  found.  However,  it  was  not  possible  to  tune  the  PID  controller  for 
satisfactory  responses,  across  the  whole  operating  range. 

The  Fig.  21  shows  the  response  of  the  NGMV  controller  and  of  two  of  the  PID  controllers  obtained.  The 
dynamic  response  of  the  NGMV  controller  is  very  close  to  the  original  one,  despite  the  significant  increase 
in  the  time  delay.  It  was  not  possible  to  obtain,  for  the  PID  controllers,  both  fast  transient  responses  at  the 
operating  point  =  3  and  no  oscillatory  behaviour  at  the  operating  point  =  0.  The  PID  controller  did  not 
have  time  delay  compensation,  so  it  might  be  argued  that  it  is  not  a  fair  comparison,  nevertheless  it 
demonstrates  a  potential  of  the  NGMV  controller  to  control  highly  nonlinear  plants  with  significant  time 
delays.  Moreover,  although  the  link  to  a  Smith  Predictor  time  delay  compensator  was  made,  the  approach 
has  the  significant  advantage  over  the  Smith  Predictor,  that  it  provides  a  stochastic  control  design 
procedure,  whereas  the  Smith  Predictor  only  provides  a  structure  (there  is  no  guidance  how  to  design  the 
controller  for  say  disturbance  rejection). 


Output 


Control 


Figure  21:  Time  Responses  for  Time  Delay  Increased  from  4  to  10  Samples 
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The  robustness  of  full-order  and  restricted  structure  optimal  control  problems  was  considered  for 
continuous-time  linear  systems.  The  emphasis  was  on  the  improvement  of  robustness  by  adding  a 
sensitivity  costing  term  in  the  cost  index  and  by  introducing  a  fictitious  disturbance  model  Wp.  The  effect 
of  measurement  noise  was  also  investigated  and  this  was  introduced  in  the  feedback  system  model.  The 
robustness  weighting  acts  directly  on  the  sensitivity  function  and  may  improve  robustness  margins  but 
other  properties  will  probably  deteriorate  like  the  measurement  noise  rejection  properties.  If  robustness  is 
more  important  than  stochastic  properties  then  attention  would  turn  to  I  IX  cost  minimization  and  many  of 
the  ideas  presented  above  would  apply  (Grimble,  1986  [30]).  However,  such  an  approach  is  readily 
embedded  in  the  usual  mixed  sensitivity  Ho,  design  problem. 

A  relatively  simple  controller  for  nonlinear  multivariable  and  possibly  time-varying  systems  was  also 
introduced.  The  closed  loop  stability  of  the  system  was  shown  to  depend  upon  the  existence  of  a  stable 
inverse  for  a  particular  loop  operator.  This  operator  depended  upon  the  cost  weighting  definitions.  It  was 
shown  that  a  possible  starting  point  for  weighting  selection  was  through  the  relationship  to  a  PID 
controller.  That  is,  if  it  is  assumed  that  PID  controller  exists,  to  stabilize  the  delay  free  plant  model  Wk , 
then  this  guaranteed  the  existence  of  at  least  one  set  of  control  weightings  that  would  ensure  closed-loop 
stability. 

A  major  advantage  of  the  NGMV  solution  is  that  the  only  knowledge  of  the  nonlinear  plant  model  that  is 
required,  is  the  ability  to  compute  an  output  for  a  given  control  input  sequence.  Such  a  model,  assumed 
stable,  coidd  be  in  Fortran  or  C  code,  or  might  even  include  look  up  tables  or  a  neural  network.  The 
remaining  computations  concern  the  linear  disturbance  and  reference  signal  models  and  knowledge  of  the 
transport  delay  element  of  length  k.  These  are  representative  linear  approximations  and  experience 
suggests  they  will  be  adequate  so  long  as  they  capture  the  dominant  frequency  response  behaviour.  It 
follows  that  such  a  controller  can  be  calculated  without  the  usual  model  information  required  in  traditional 
model  based  control  law  design. 

The  relationship  to  the  Smith  Predictor  was  discussed  for  two  reasons.  Firstly  the  extension  of  Smith ’s 
ideas  to  the  nonlinear  problem  is  interesting  and  provides  a  practical  method  of  implementing  these 
controllers,  when  the  plant  is  open-loop  stable.  Secondly  the  physical  structure  is  useful  to  provide  an 
intuitive  understanding  of  the  operation  and  properties  of  the  proposed  Nonlinear  GMV  controller.  The 
structure  in  Fig.  14,  that  describes  the  Nonlinear  Smith  Predictor,  is  particularly  illuminating. 
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